3574 results found.
Written
Corpus,
Language Type:
Multilingual
Languages:
English German french italian
Availability:
Freely Available
License:
Open source for annotations; license for source text as stated in the paper
Size:
20 000 000 words Production Status:
Newly created-finished
Use:
Machine Translation, SpeechToSpeech Translation
-
Paper title:SwissAdmin: A multilingual tagged parallel corpus of press releases
-
Paper track:Written
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country | ||
|---|---|---|---|---|---|
| Author 1 | Yves Scherrer | LATL-CUI, Université de Genève | FI | ||
| Author 2 | Luka Nerima | LATL-CUI, Université de Genève | CH | LATL-University of Geneva | None |
| Author 3 | Lorenza Russo | LATL-CUI, Université de Genève | CH | ||
| Author 4 | Maria Ivanova | LATL-CUI, Université de Genève | CH | ||
| Author 5 | Eric Wehrli | LATL-CUI, Université de Genève | CH | ||
| Main Contact | Yves Scherrer | University of Helsinki | None |
Documentation:
The paper itself documents the corpus.Language Type:
Multilingual
Languages:
English German Spanish french italian
Availability:
Freely Available
License:
<Not Specified>
Size:
200000 entries Production Status:
Newly created-finished
Use:
Document Classification, Text categorisation
-
Paper title:A Corpus for Multilingual Document Classification in Eight Languages
-
Paper track:Evaluation
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Holger Schwenk | Facebook AI Research | FR |
| Author 2 | Xian Li | US | |
| Main Contact | Holger Schwenk | Facebook AI Research | None |
Documentation:
<Not Specified>
Written
Lexicon,
Language Type:
Multilingual
Languages:
English German Mandarin Chinese Slovenian Spanish
Availability:
Freely Available
License:
<Not Specified>
Size:
25 GB Production Status:
Newly created-finished
Use:
Semantic Web
-
Paper title:xLiD-Lexica: Cross-lingual Linked Data Lexica
-
Paper track:Written
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Lei Zhang | Karlsruhe Institute of Technology | DE |
| Author 2 | Michael Färber | Karlsruhe Institute of Technology | DE |
| Author 3 | Achim Rettinger | Karlsruhe Institute of Technology | DE |
| Main Contact | Lei Zhang | Karlsruhe Institute of Technology | None |
Documentation:
<Not Specified>
Written
Corpus,
Language Type:
Multilingual
Languages:
English Hindi Punjabi Tamil
Availability:
From Data Center(s)
License:
TDIL, Government of India
Size:
600000 sentences Production Status:
Existing-used
Use:
Machine Translation, SpeechToSpeech Translation
-
Paper title:Issues in chunking parallel corpora: mapping Hindi-English verb group in ILCI
-
Paper track:Short Paper
-
Paper status:Accept
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Esha Banerjee | Jawaharlal Nehru University | IN |
| Author 2 | Akanksha Bansal | Jawaharlal Nehru University | None |
| Author 3 | Girish Jha | Jawaharlal Nehru University, New Delhi | IN |
| Main Contact | Esha Banerjee | Jawaharlal Nehru University | None |
Documentation:
<Not Specified>
Multimodal/Multimedia
Ontology,
Language Type:
Multilingual
Languages:
English Mandarin Chinese Spanish italian
Availability:
Freely available web interface
License:
Not yet defined
Size:
1017 Production Status:
Newly created-finished
Use:
Word Sense Disambiguation
-
Paper title:The IMAGACT Visual Ontology. An Extendable Multilingual Infrastructure for the representation of lexical encoding of Action
-
Paper track:Infrastructural Issues/Large Projects
-
Paper status:Accept Poster+Demo
| Author Number | Name | Affiliation | Country | ||
|---|---|---|---|---|---|
| Author 1 | Massimo Moneglia | University of Florence | IT | ||
| Author 2 | Susan Brown | University of Florence | IT | ||
| Author 3 | Francesca Frontini | ILC-CNR | IT | CNR ILC | None |
| Author 4 | Gloria Gagliardi | University of Florence | IT | ||
| Author 5 | Fahad Khan | ILC-CNR | IT | ||
| Author 6 | Monica Monachini | Institute of Computational Linguistics - CNR | IT | ||
| Author 7 | Alessandro Panunzi | University of Florence | IT | ||
| Main Contact | Massimo Moneglia | University of Florence | None |
Documentation:
Public documentation in EnglishLanguage Type:
Multilingual
Languages:
English German Portuguese Russian Turkish
Availability:
Not Available
License:
-
Size:
38000 words Production Status:
Newly created-in progress
Use:
Discourse
-
Paper title:Multilingual Extension of PDTB-Style Annotation: The Case of TED Multilingual Discourse Bank
-
Paper track:Written
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Deniz Zeyrek | Middle East Technical University | TR |
| Author 2 | Amália Mendes | Centre for Linguistics of the University of Lisbon | PT |
| Author 3 | Murathan Kurfalı | Middle East Technical University | TR |
| Main Contact | Deniz Zeyrek | Middle East Technical University | None |
Documentation:
An annotation manual in English exists. Currently only available for the annotators.Language Type:
Multilingual
Languages:
English Iranian Persian Standard Arabic Urd
Availability:
<Not Specified>
License:
<Not Specified>
Size:
Very big OtherProduction Status:
Collected so far for a year but will continue the collection till the start of the LREC conference
Use:
Corpus Creation/Annotation
-
Paper title:Creation of comparable corpora for English-{Urdu, Arabic, Persian}
-
Paper track:Written
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Murad Abouammoh | King Saud University | SA |
| Author 2 | Kashif Shah | University of Sheffield | GB |
| Author 3 | Ahmet Aker | University of Sheffield | GB |
| Main Contact | Ahmet Aker | University of Sheffield | None |
Documentation:
<Not Specified>
Multimodal/Multimedia
Repository,
Language Type:
Multilingual
Languages:
English German German Sign Language Japanese
Availability:
Freely Available
License:
<Not Specified>
Size:
2.62 TB OtherProduction Status:
Existing-updated
Use:
Phonetics, Speech Recognition, Machine Translation, etc.
-
Paper title:The BAS Speech Data Repository
-
Paper track:Infrastructural Issues/Large Projects
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Uwe Reichel | Hungarian Academy of Sciences | HU |
| Author 2 | Florian Schiel | Bavarian Archive for Speech Signals | DE |
| Author 3 | Thomas Kisler | University of Munich | DE |
| Author 4 | Christoph Draxler | Institute of Phonetics and Speech Processing, LMU Munich | DE |
| Author 5 | Nina Pörner | University of Munich | DE |
| Main Contact | Uwe Reichel | Hungarian Academy of Sciences | None |
Documentation:
Corpus documentations (mainly German, English) available on the corpus landing pages
Written
Software Toolkit,
Language Type:
Multilingual
Languages:
Catalan English German Spanish italian
Availability:
Freely Available
License:
Affero GPL
Size:
2 GByte Production Status:
Existing-updated
Use:
Tool Suite Including Many Modules
-
Paper title:Coreference Resolution in FreeLing 4.0
-
Paper track:Written
-
Paper status:Accept Poster+DemoSuggested
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Montserrat Marimon | Universitat Pompeu Fabra | ES |
| Author 2 | Lluís Padró | Universitat Politècnica de Catalunya | ES |
| Author 3 | Jordi Turmo | Universitat Politècnica de Catalunya (UPC) | ES |
| Main Contact | Montserrat Marimon | Universitat Pompeu Fabra | None |
Documentation:
https://talp-upc.gitbooks.io/freeling-user-manual/Language Type:
Multilingual
Languages:
English German Russian Spanish french
Availability:
From Owner
License:
<Not Specified>
Size:
20 semantic fields OtherProduction Status:
Existing-used
Use:
Evaluation/Validation
-
Paper title:Typology of Adjectives Benchmark for Compositional Distributional Models
-
Paper track:Evaluation
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Daria Ryzhova | Higher School of Economics | RU |
| Author 2 | Maria Kyuseva | Higher School of Economics | RU |
| Author 3 | Denis Paperno | University of Trento | IT |
| Main Contact | Daria Ryzhova | Higher School of Economics | None |
Documentation:
no




